fix #18 - Add GitHub Action to enforce Apache 2.0 license headers#27
Conversation
|
@fantonangeli you must sign your commits with a key that matches your email. |
Signed-off-by: fantonangeli <fabrizio.antonangeli@gmail.com>
5dd02f0 to
40a08ea
Compare
There was a problem hiding this comment.
Pull request overview
Adds automated license-header enforcement using Apache RAT so pull requests fail if files without approved licensing are introduced, supporting the repository’s Apache 2.0 compliance goal.
Changes:
- Add a CI workflow that runs Apache RAT on pushes to
mainand on PR events. - Introduce
.rat-excludesto exclude certain non-source/config/generated files from RAT scanning. - Add Apache 2.0 license headers to
README.mdand.gitignore.
Reviewed changes
Copilot reviewed 3 out of 4 changed files in this pull request and generated 3 comments.
| File | Description |
|---|---|
| README.md | Adds an Apache 2.0 license header in an HTML comment. |
| .rat-excludes | Defines files to exclude from Apache RAT scanning. |
| .gitignore | Adds an Apache 2.0 license header comment block. |
| .github/workflows/ci_check_license_headers.yaml | New workflow that downloads and runs Apache RAT, failing on unapproved files. |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
You can also share your feedback on Copilot code review. Take the survey.
| uses: actions/checkout@v3 | ||
|
|
||
| - name: Download Apache RAT | ||
| run: | | ||
| curl -LO "https://repo1.maven.org/maven2/org/apache/rat/apache-rat/${APACHE_RAT_VERSION}/apache-rat-${APACHE_RAT_VERSION}.jar" | ||
|
|
There was a problem hiding this comment.
This downloads and executes a remote JAR via curl without any integrity verification (checksum/signature) and uses an unpinned actions/checkout@v3. For supply-chain safety, pin actions to a commit SHA (or at least update to the current major) and verify the RAT artifact (e.g., download the corresponding .sha512/signature and validate before running).
| uses: actions/checkout@v3 | |
| - name: Download Apache RAT | |
| run: | | |
| curl -LO "https://repo1.maven.org/maven2/org/apache/rat/apache-rat/${APACHE_RAT_VERSION}/apache-rat-${APACHE_RAT_VERSION}.jar" | |
| uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # actions/checkout@v4 | |
| - name: Download Apache RAT | |
| run: | | |
| set -e | |
| BASE_URL="https://repo1.maven.org/maven2/org/apache/rat/apache-rat/${APACHE_RAT_VERSION}" | |
| # Download JAR and corresponding SHA-512 checksum | |
| curl -L -o "apache-rat-${APACHE_RAT_VERSION}.jar" "${BASE_URL}/apache-rat-${APACHE_RAT_VERSION}.jar" | |
| curl -L -o "apache-rat-${APACHE_RAT_VERSION}.jar.sha512" "${BASE_URL}/apache-rat-${APACHE_RAT_VERSION}.jar.sha512" | |
| # Verify the downloaded JAR against the published checksum | |
| EXPECTED_SHA512="$(awk '{print $1}' "apache-rat-${APACHE_RAT_VERSION}.jar.sha512")" | |
| ACTUAL_SHA512="$(sha512sum "apache-rat-${APACHE_RAT_VERSION}.jar" | awk '{print $1}')" | |
| if [ "${EXPECTED_SHA512}" != "${ACTUAL_SHA512}" ]; then | |
| echo "Checksum verification FAILED for apache-rat-${APACHE_RAT_VERSION}.jar" >&2 | |
| exit 1 | |
| fi |
Signed-off-by: fantonangeli <fabrizio.antonangeli@gmail.com>
Signed-off-by: fantonangeli <fabrizio.antonangeli@gmail.com>
2947dc9 to
68eca73
Compare
Signed-off-by: fantonangeli <fabrizio.antonangeli@gmail.com>
There was a problem hiding this comment.
Pull request overview
Copilot reviewed 3 out of 4 changed files in this pull request and generated 4 comments.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
You can also share your feedback on Copilot code review. Take the survey.
|
|
||
| # Download JAR and corresponding SHA-1 checksum | ||
| curl -LO "${BASE_URL}/${APACHE_RAT_JAR}" | ||
| curl -LO "${BASE_URL}/${APACHE_RAT_SHA}" | ||
|
|
||
| # Verify the downloaded JAR against the published checksum | ||
| EXPECTED_SHA1="$(awk '{print $1}' $APACHE_RAT_SHA)" | ||
| ACTUAL_SHA1="$(sha1sum $APACHE_RAT_JAR| awk '{print $1}')" | ||
| if [ "${EXPECTED_SHA1}" != "${ACTUAL_SHA1}" ]; then | ||
| echo "Checksum verification FAILED for ${APACHE_RAT_JAR}" >&2 | ||
| exit 1 | ||
| fi | ||
| rm $APACHE_RAT_SHA | ||
|
|
||
| - name: Run Apache RAT | ||
| run: | | ||
| APACHE_RAT_JAR="apache-rat-${APACHE_RAT_VERSION}.jar" | ||
|
|
||
| # Redirect output to .rat-reports file, continue even if RAT returns non-zero exit code because we want to print Unapproved documents | ||
| java -jar $APACHE_RAT_JAR --input-exclude-file .rat-excludes -- . > .rat-reports 2>&1 || true |
There was a problem hiding this comment.
The workflow downloads apache-rat-${APACHE_RAT_VERSION}.jar into the repository root and then runs RAT over .. This means the freshly-downloaded JAR itself is included in the scan, which can cause RAT to report it as an unapproved/binary file and fail the job. Download the JAR into a temp directory outside the scan root (e.g., $RUNNER_TEMP) or add an exclude pattern for apache-rat-*.jar and scan only tracked repo files.
| # Download JAR and corresponding SHA-1 checksum | |
| curl -LO "${BASE_URL}/${APACHE_RAT_JAR}" | |
| curl -LO "${BASE_URL}/${APACHE_RAT_SHA}" | |
| # Verify the downloaded JAR against the published checksum | |
| EXPECTED_SHA1="$(awk '{print $1}' $APACHE_RAT_SHA)" | |
| ACTUAL_SHA1="$(sha1sum $APACHE_RAT_JAR| awk '{print $1}')" | |
| if [ "${EXPECTED_SHA1}" != "${ACTUAL_SHA1}" ]; then | |
| echo "Checksum verification FAILED for ${APACHE_RAT_JAR}" >&2 | |
| exit 1 | |
| fi | |
| rm $APACHE_RAT_SHA | |
| - name: Run Apache RAT | |
| run: | | |
| APACHE_RAT_JAR="apache-rat-${APACHE_RAT_VERSION}.jar" | |
| # Redirect output to .rat-reports file, continue even if RAT returns non-zero exit code because we want to print Unapproved documents | |
| java -jar $APACHE_RAT_JAR --input-exclude-file .rat-excludes -- . > .rat-reports 2>&1 || true | |
| APACHE_RAT_DIR="${RUNNER_TEMP:-/tmp}" | |
| mkdir -p "${APACHE_RAT_DIR}" | |
| cd "${APACHE_RAT_DIR}" | |
| # Download JAR and corresponding SHA-1 checksum | |
| curl -LO "${BASE_URL}/${APACHE_RAT_JAR}" | |
| curl -LO "${BASE_URL}/${APACHE_RAT_SHA}" | |
| # Verify the downloaded JAR against the published checksum | |
| EXPECTED_SHA1="$(awk '{print $1}' "${APACHE_RAT_SHA}")" | |
| ACTUAL_SHA1="$(sha1sum "${APACHE_RAT_JAR}" | awk '{print $1}')" | |
| if [ "${EXPECTED_SHA1}" != "${ACTUAL_SHA1}" ]; then | |
| echo "Checksum verification FAILED for ${APACHE_RAT_JAR}" >&2 | |
| exit 1 | |
| fi | |
| rm "${APACHE_RAT_SHA}" | |
| - name: Run Apache RAT | |
| run: | | |
| APACHE_RAT_JAR="${RUNNER_TEMP:-/tmp}/apache-rat-${APACHE_RAT_VERSION}.jar" | |
| # Redirect output to .rat-reports file, continue even if RAT returns non-zero exit code because we want to print Unapproved documents | |
| java -jar "$APACHE_RAT_JAR" --input-exclude-file .rat-excludes -- . > .rat-reports 2>&1 || true |
Signed-off-by: fantonangeli <fabrizio.antonangeli@gmail.com>
Signed-off-by: fantonangeli <fabrizio.antonangeli@gmail.com>
…hanges#r2897289283 Signed-off-by: fantonangeli <fabrizio.antonangeli@gmail.com>
…hanges#r2897676663 Signed-off-by: fantonangeli <fabrizio.antonangeli@gmail.com>
Signed-off-by: fantonangeli <fabrizio.antonangeli@gmail.com>
There was a problem hiding this comment.
Pull request overview
Copilot reviewed 3 out of 4 changed files in this pull request and generated 3 comments.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
You can also share your feedback on Copilot code review. Take the survey.
| - name: Cache Apache RAT | ||
| uses: actions/cache@v5 | ||
| id: cache-rat | ||
| with: | ||
| path: ${{ runner.temp }}/apache-rat-${{ env.APACHE_RAT_VERSION }}.jar | ||
| key: apache-rat-${{ env.APACHE_RAT_VERSION }} | ||
|
|
||
| - name: Download Apache RAT | ||
| if: steps.cache-rat.outputs.cache-hit != 'true' | ||
| run: | |
There was a problem hiding this comment.
The RAT JAR’s checksum is only verified on fresh download. When the cache is hit, the workflow uses the cached JAR without any integrity check, so a corrupted/poisoned cache entry could go undetected. Consider always verifying the checksum (even on cache-hit) or incorporate the expected digest into the cache key and validate the restored file before running RAT.
There was a problem hiding this comment.
I would skip this comment @ricardozanini
Closes #18
Summary
Add a GitHub Actions workflow that validates Apache 2.0 license headers are present in all applicable source files and fails the build if headers are missing. The check must execute on every Pull Request.
Goals
Ensure all source files comply with Apache 2.0 licensing requirements.
Non-Goals